|
|
Accession Number |
TCMCG075C10084 |
gbkey |
CDS |
Protein Id |
XP_007038668.2 |
Location |
complement(join(28676700..28676703,28676897..28677213,28677353..28677823,28678079..28678228,28679026..28679097,28679180..28679440,28679576..28679707,28679814..28680077,28680464..28680715,28680808..28680906,28681444..28681608,28681917..28682102,28682694..28683668)) |
Gene |
LOC18605549 |
GeneID |
18605549 |
Organism |
Theobroma cacao |
|
|
Length |
1115aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007038606.2
|
Definition |
PREDICTED: DNA mismatch repair protein MSH3 [Theobroma cacao] |
CDS: ATGGGGAAGCAAAAGCAACAAGTCATTTCTCGTTTTTTTGCCCCCAAACCCAAAACCCCATCCACCCCAACTCCACCAGCAAACCCTTCATCTTCTCCGTCTCCTCCTTCACCGCCAATCCCATCACCCAACGTAAAAGCAACTGTCTCTTTTTCCCCTTCAAAGCGCAAACTCCTCTCAACCCACCTCACTTCCACTCCTAAGAAACCCAAAACCACGCTTTCACCTCACACCCACAACCCCGTTCCTCTTCAGTCTAATCCTTCCCTCCACCAAAAATTCCTCCACAAACTTCTGGAACCTTCTCCACGACGTCCGCTTGAACCTACCGTCGAACTTTCCGGATCCGACCACAAAAAGTACACCCCACTAGAACAACAAGTGGTGGATTTAAAAAACAAATACCCGGATGTTCTTCTCATGGTGGAAGTCGGTTACAGGTTCCGATTCTTCGGGAAGGATGCGGAAATCGCGGCGAAAGAGTTGGGAATATATGCCCACGTGGACCGCAACTTCTTAACGGCTAGCGTACCTACTTTTCGACTGAATGTCCACGTGAGGAGGCTGGTCAGTGCGGGATACAAGGTTGGTGTGGTGAAACAGACAGAAACGGCGGCGATTAAGGCGCATGGTTCGAACCGAGTTGGACCGTTTTGCAGGGGTTTGTCGGCATTATACACGAAGGCTACGCTGGAGGCCGCGGAGGATGTGGGAGGGAAAGAGGAAGGGTGTGGTGGAGAGAGTAATTATTTGGTTTGCGTTGTGGAGAAAGGTTTGGAGTTTTCGGGGTCTGTTTCAGGTTCTGGTGCGGTTGATGTGAGGGTTGGAATTGTTGGAGTGGAGATTTCAACGGGGGATGTTGTTTATGGGGAGTTTGATGATGGAGTTATGAGGAGCGGGCTTGAAGCTGTGGTTTTTAGCTTGGCTCCCGCTGAGTTATTGGTTGGAGAACCGCTTTCGAAACAAACAGAAAAGTTGTTATTGGCATATGCTGGACCTGCTTCAAATGTTCGTCTGGAGCATGCCTCTTGTGATTGTTTCAAGGGTGGTGGCGCACTTGCGGAAGTGATGTCTGTGTATGAGAAGATGGTTGAAGATAATTTAGCCAGTAATGTGAATCAGTCATTGGAGGCAACAGAATATTCTCACTCTTCAATTCAGGGGGTTATGAACATGCCAGATTTGGCTTTACAAGCTTTGGCCTTAACCATTCGTCATCTCAAGCAATTTGGATTTGAAAGAATTGTGTGCCTTGAAGCTTCATTTCGTTCCTTATCAAGCAGTTTGGAGATGAATCTTTCAGCAAATACACTTCAACAATTAGAGATTTTGAGGAATAATTCAGATGGGTCTGAATCTGGCTCCTTGCTGCAAATTATGAACCATACTCTTACTATTTATGGATCAAGGCTTCTTAGACACTGGGTGACTCATCCTTTATGTGATAGAACCATGATATCTGCTCGACTTGATGCTGTTTCTGAAATTGCTTTGTCCATGGGGTGTTATAAAGTCTCACAAAGTATCATTGAGATAGATGGGGAAGATTCTGATGTGACCATTGCACAACCAGAATTCTACTCTGTGCTTTCCTCAGTTTTAACTTTTTTAGGAAGATCACCTGATATTCAGCGTGGAATAACAAGAATCTTCCATCGAACTGCCACCCCAGCAGAGTTCATTGCAGTTATTCAAGCTATTTTATCTGCTGGAAAGCAACTTCAGCGGCTTCATATTGATGAAGAACATGAAGACAATTGCAGTAAGAAAGTGCGAGTAGGGATTGTGCAGTCAGCTCTGTTGAAAAGGTTGATTTTGACTGCTTCATCATCCAATGTTCTTGGCAATGCTGCAAAACTGCTATCTTTCCTAAACAAAGAAGCAGCTGATAAAGGGGATTTAACAAACTTAATCATCATTTCTAACAACCAATTTCCGGAGGTTGCTAGAGCTAGGAAAGCAGTTCAATTGGCGAAGGAGAAACTGGATAACTTGATTTTCTTGTATAGAAAGCGACTTGGGAAAGGCAATTTGGAATTTATGTGTGTGTCAGGAACCACACATTTGATAGAGCTACCCATAGATGCCAATGTACCTTCAAACTGGGTTAAGGTAAATAGTACCAAAAAGACAATAAGGTATCATCCGCCTGAAGTATTGACTGCTCTAGACCAGTTAACACTGGCAAATGAAGAGCTCACCATTATCTGTCGAGCTGCTTGGGACAGCTTTCTTAGGGAATTTGGTGAATATTACTCCGAGTTTCAAGCTGCTGTTCAAGCACTTGCTGCTTTGGACTGTTTGCACTCTCTTGCCACTCTCTCAAGAAATAAGAATTATGTTCGGCCTATCTTTGTGGATGACAATGAACCTGTTCAGATACAAATCCACTCCGGTCGTCACCCTGTGTTGGAGACCATCTTACAAGAGGGTTTTGTTCCAAATGACACAACATTGCATGCAGACAGGGAGTGTTGTCAGATTGTTACTGGTCCTAATATGGGTGGAAAGAGTTGCTACATTCGCCAGGTTGCACTAATTGCAATGATGGCTCAGGTTGGTTCCTTTGTACCAGCAGCATCAGCTACTTTGCATGTGTTAGATGCTATCTACACACGCATGGGTGCTTCTGACAGTATACAACAAGGGAGAAGTACATTTCTAGAAGAACTAAGTGAGGCTTCTCAAATACTCCACAGCTGCACAGCACGCTCACTGGTTGTAATTGATGAGCTTGGAAGAGGAACTAGTACACATGATGGTGTATCTATTGCTTATGCTACATTACATCATCTGTTGGAGCAGAGAAAATGCATGGTCCTCTTTGTAACCCACTACCCTAGAATTGCTGATATTAAAGTTGAATTTCCTGGTTCTGTGGAGGTATATCATGTTTCATATCTGACTGCACATAATGATGAGGTTACTATGGATGCAAAATCTGATCATGAAGTCACGTACCTATATAAGCTTGTTCCTGGTGTTTCTGCAAGGAGTTTTGGATTCAAGGTTGCACAGCTTGCCCAGCTGCCTTCATCATGCATCAGTCAAGCAATTATCATGGCTACAAGGCTGGAAGCAATTGAAAGCAGCAGAGTGAGAAAGAAATCAGAAGAAAGGCAGCCAGAAACATCATCGAGTGATCAAGAACTAGAAACACAAGAGAACATACTGAAATCCATTGGTAGCTTCTCCAGTGAAAGGCTAGAGAATTTAGAAGAATTTGCCAGTGCTTTCAGTGACTTGCTTTTGAACTTGAAATCTGCAAGAACGGATGATGACCTTGGCAAAAGCTTTCAGTTATTGAAAGAGGCTAGAAGCATTGCAAAGGAATTGATAAACAGATAA |
Protein: MGKQKQQVISRFFAPKPKTPSTPTPPANPSSSPSPPSPPIPSPNVKATVSFSPSKRKLLSTHLTSTPKKPKTTLSPHTHNPVPLQSNPSLHQKFLHKLLEPSPRRPLEPTVELSGSDHKKYTPLEQQVVDLKNKYPDVLLMVEVGYRFRFFGKDAEIAAKELGIYAHVDRNFLTASVPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNRVGPFCRGLSALYTKATLEAAEDVGGKEEGCGGESNYLVCVVEKGLEFSGSVSGSGAVDVRVGIVGVEISTGDVVYGEFDDGVMRSGLEAVVFSLAPAELLVGEPLSKQTEKLLLAYAGPASNVRLEHASCDCFKGGGALAEVMSVYEKMVEDNLASNVNQSLEATEYSHSSIQGVMNMPDLALQALALTIRHLKQFGFERIVCLEASFRSLSSSLEMNLSANTLQQLEILRNNSDGSESGSLLQIMNHTLTIYGSRLLRHWVTHPLCDRTMISARLDAVSEIALSMGCYKVSQSIIEIDGEDSDVTIAQPEFYSVLSSVLTFLGRSPDIQRGITRIFHRTATPAEFIAVIQAILSAGKQLQRLHIDEEHEDNCSKKVRVGIVQSALLKRLILTASSSNVLGNAAKLLSFLNKEAADKGDLTNLIIISNNQFPEVARARKAVQLAKEKLDNLIFLYRKRLGKGNLEFMCVSGTTHLIELPIDANVPSNWVKVNSTKKTIRYHPPEVLTALDQLTLANEELTIICRAAWDSFLREFGEYYSEFQAAVQALAALDCLHSLATLSRNKNYVRPIFVDDNEPVQIQIHSGRHPVLETILQEGFVPNDTTLHADRECCQIVTGPNMGGKSCYIRQVALIAMMAQVGSFVPAASATLHVLDAIYTRMGASDSIQQGRSTFLEELSEASQILHSCTARSLVVIDELGRGTSTHDGVSIAYATLHHLLEQRKCMVLFVTHYPRIADIKVEFPGSVEVYHVSYLTAHNDEVTMDAKSDHEVTYLYKLVPGVSARSFGFKVAQLAQLPSSCISQAIIMATRLEAIESSRVRKKSEERQPETSSSDQELETQENILKSIGSFSSERLENLEEFASAFSDLLLNLKSARTDDDLGKSFQLLKEARSIAKELINR |